Neural Networks AI News & Research

🎓 News MIT Technology Review — AI 2 min read

LLMs+

When ChatGPT launched as an experimental prototype in late 2022, OpenAI’s chatbot became an everyday everything app for hundreds of millions of people. LLMs like ChatGPT were the new future: The entire tech industry was consumed by the inferno, with companies racing to spin up rival products. The ashes of the old tech world still…

#Large Language Models #AI Efficiency #Neural Networks

🕐 21 days ago

Read →

🐻 Research Berkeley AI Research 6 min read

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a…

#machine learning #word embeddings #representation learning

🕐 8 months ago

Read →

🤖 Business TOPBOTS 1 min read

How Do LLMs Think? 5 Approaches Powering the Next Generation of AI Reasoning

Large Language Models (LLMs) have come a long way since their early days of mimicking autocomplete on steroids. But generating fluent text isn’t enough – true intelligence demands reasoning. That…

#LLMs #AI Reasoning #Machine Learning

🕐 1 year, 1 month ago

Read →

📐 Research The Gradient 14 min read

Car-GPT: Could LLMs finally make self-driving cars happen?

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

#autonomous vehicles #large language models #machine learning

🕐 2 years ago

Read →

📐 Research The Gradient 31 min read

Neural algorithmic reasoning

In this article, we will talk about classical computation : the kind of computation typically found in an undergraduate Computer Science course on Algorithms and Data Structures [1]. Think shortest…

#neural networks #algorithms #machine learning

🕐 2 years ago

Read →

🏃 Research fast.ai 14 min read

Can LLMs learn from a single example?

Summary: recently while fine-tuning a large language model (LLM) on multiple-choice science exam questions, we observed some highly unusual training loss curves. In particular, it appeared the model was able…

#machine learning #LLMs #neural networks

🕐 2 years ago

Read →

🔬 Research Distill.pub 38 min read

Understanding Convolutions on Graphs

Understanding the building blocks and design choices of graph neural networks.

#machine learning #graph neural networks #deep learning

🕐 4 years ago

Read →

🔬 Research Distill.pub 13 min read

Weight Banding

Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.

#neural networks #machine learning #computer vision

🕐 5 years ago

Read →

🔬 Research Distill.pub 16 min read

Branch Specialization

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

#neural networks #interpretability #circuit analysis

🕐 5 years ago

Read →

🔬 Research Distill.pub 9 min read

Multimodal Neurons in Artificial Neural Networks

We report the existence of multimodal neurons in artificial neural networks, similar to those found in the human brain.

#neural networks #interpretability #multimodal learning

🕐 5 years ago

Read →

🔬 Research Distill.pub 32 min read

Self-Organising Textures

Neural Cellular Automata learn to generate textures, exhibiting surprising properties.

#neural networks #cellular automata #texture synthesis

🕐 5 years ago

Read →

🔬 Research Distill.pub 16 min read

Visualizing Weights

We present techniques for visualizing, contextualizing, and understanding neural network weights.

#neural networks #interpretability #visualization

🕐 5 years ago

Read →

🔬 Research Distill.pub 3 min read

Curve Circuits

Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.

#neural networks #interpretability #curve detection

🕐 5 years ago

Read →

🔬 Research Distill.pub 19 min read

High-Low Frequency Detectors

A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.

#neural networks #computer vision #feature detection

🕐 5 years ago

Read →

🔬 Research Distill.pub 20 min read

Naturally Occurring Equivariance in Neural Networks

Neural networks naturally learn many transformed copies of the same feature, connected by symmetric weights.

#neural networks #equivariance #symmetry

🕐 5 years ago

Read →

🔬 Research Distill.pub 36 min read

Curve Detectors

Part one of a three part deep dive into the curve neuron family.

#neural networks #interpretability #computer vision

🕐 5 years ago

Read →

🔬 Research Distill.pub 28 min read

An Overview of Early Vision in InceptionV1

An overview of all the neurons in the first five layers of InceptionV1, organized into a taxonomy of 'neuron groups.'

#neural networks #computer vision #deep learning

🕐 6 years ago

Read →

🔬 Research Distill.pub 42 min read

Visualizing Neural Networks with the Grand Tour

By focusing on linear dimensionality reduction, we show how to visualize many dynamic phenomena in neural networks.

#neural networks #visualization #interpretability

🕐 6 years ago

Read →

🔬 Research Distill.pub 5 min read

Thread: Circuits

What can we learn if we invest heavily in reverse engineering a single neural network?

#deep learning #neural networks #interpretability

🕐 6 years ago

Read →

🔬 Research Distill.pub 42 min read

Zoom In: An Introduction to Circuits

By studying the connections between neurons, we can find meaningful algorithms in the weights of neural networks.

#machine learning #neural networks #interpretability

🕐 6 years ago

Read →

🔬 Research Distill.pub 25 min read

Growing Neural Cellular Automata

Training an end-to-end differentiable, self-organising cellular automata model of morphogenesis, able to both grow and regenerate specific patterns.

#artificial life #cellular automata #morphogenesis

🕐 6 years ago

Read →

🔬 Research Distill.pub 36 min read

Visualizing the Impact of Feature Attribution Baselines

Exploring the baseline input hyperparameter, and how it impacts interpretations of neural network behavior.

#interpretability #neural networks #feature attribution

🕐 6 years ago

Read →

🔬 Research Distill.pub 9 min read

A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features': Adversarial Example Researchers Need to Expand What is Meant by 'Robustness'

The main hypothesis in Ilyas et al. (2019) happens to be a special case of a more general principle that is commonly accepted in the robustness to distributional shift literature

#adversarial examples #robustness #distribution shift

🕐 6 years ago

Read →

🔬 Research Distill.pub 6 min read

A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features': Robust Feature Leakage

An example project using webpack and svelte-loader and ejs to inline SVGs

#adversarial examples #robustness #machine learning

🕐 6 years ago

Read →

Neural Networks AI News & Research · DeepTrendLab

Neural Networks